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Abstract. Suppose that / is a function on M n such that exp(a | • \ )f and 
exp(6 | • | )/ are bounded, where a, b > 0. Hardy's Uncertainty Principle as- 
serts that if ab > 7r 2 , then / = 0, while if ab = 7r 2 , then / = cexp(— a | ■ | 2 ). In 
this paper, we generalise this uncertainty principle to vector-valued functions, 
and hence to operators. The principle for operators can be formulated loosely 
by saying that the kernel of an operator cannot be localised near the diagonal 
if the spectrum is also localised. 



1. Introduction 
We define the Fourier transform / of a function / on W 1 by 

(1) /(£):=/ f(x)exp(-2iri€-x)dx V£ S ffi" , 

JR™ 

provided that this makes sense. 

G.H. Hardy [6] showed that if / is a function on R n such that 

\f(x)\ < cexp(-a \x\ 2 ) Vx G K" 

< cexp(-6|e| 2 ) VeeM", 

where a, b > 0, and if ab > ir 2 , then / = 0, while if ab — tt 2 , then f(x) = 
c'exp(— a \x\ 2 ). Clearly the second result implies the first. 

By rescaling, we may and shall suppose without loss of generality that a — ir in 
the above results. Similarly we may remove the constant c. 

For a detailed account of the proof and some classical related results, see H. Dym 
and H.P. McKean [5]. 

This result, which we call Hardy's Uncertainty Principle, has been generalised in 
many ways. For instance, G.W. Morgan [7] obtained a sharp version involving the 
modified exponentials exp(— a \ • \ p ) and exp(— b \ ■ \ q ), where l/p+ 1/q = 1, giving 
the extremal functions. Cowling and J.F. Price [3] showed that if p or q is finite, 
and 

exp(^ | • | 2 )/ e L p (M") 
exp(^|.| 2 )/e^(M"), 

then / = 0. 

Demange (see A. Bonami and Demange [T]) proved the best version for the usual 
quadratic exponentials. 



Date: May 6, 2008. 



1 



2 



M. G. COWLING, B. DEMANGE, AND M. SUNDARI 



Theorem 1. Suppose that <& G S(R n )' (that is, $ is a tempered distributions) and 

exp(7r|-| 2 )$e5(R")' 
expH-| 2 )$eS(]R n )', 

then 

$(x) = ^ c a x a exp(-7r|x| 2 ) Vx G R", 

\a\<N 

that is, $ zs the distribution arising by integration against the function on the right 
hand side. 

There has been interest recently in developing our understanding of heat diffusion 
and its relationship with uncertainty principles into more general contexts than R n , 
such as the Heisenberg group, where the work of Thangavelu [5] shows that the 
subelhptic world presents some surprises, Lie groups [5], where structure plays an 
important role, and differential operators on measured metric spaces [2]. 

In this paper, we extend the uncertainty principles of Hardy and of Demange to 
vector-valued functions. The former generalises naturally to Banach space valued 
functions, while the latter can be formulated for functions with values in the dual 
of a Frechet space. 

We then consider uncertainty principles for operators. Our main theorems are 
concerned with operators from <S(R") to <S(R")', where the kernel of the operator 
and one of its partial Fourier transforms satisfy conditions like those of Theorem [TJ 
and with operators on L 2 (R"), whose kernels are locally integrable and which satisfy 
inequalities like those of Hardy's theorem. We extend and sharpen previous work 
of Cowling and Sundari 4 which led us to conjecture the result proved here. All 
the results for operators are stated and proved in Section [31 

Here is one of our main results, in which P t is the heat operator and pt is its 
kernel. 

Theorem 2. Suppose that t is in R + , that K is the operator on L 2 (R") associated 
to a locally integrable kernel k, and that 

\k{x,y)\<p t {x,y) Vx,j/eR" 

\\Kf\\ 2 <\\Ptf\\ 2 V/eL 2 (R"). 

Then there is a bounded measurable function m on R™ such that 

k(x,y) =m(x)pt(x,y) Vx,yeW. n , 

and the operator K is the the heat operator P t followed by multiplication by m. 

Our other main result, Theorem [5l generalises Theorem [5] in the same way that 
Theorem [1] generalises Hardy's original result. 

In the next section, we summarise L. Schwartz' theory of distributions, includ- 
ing the case of vector- valued distributions, and establish versions of Hardy's Un- 
certainty Principle in this context. In Section [3l we consider the theory for linear 
operators and prove Theorems 1 and 4. 

2. Tempered distributions 

2.1. The Schwartz space. Recall that the Schwartz space 5(R n ) is defined to be 
the space of all smooth functions /: R" — > C such that ||/||(jv) < 00 f° r au N m 
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where 

(2) ||/|| (JV) := 



E 

\a\M<N 



■ a D?f\\lj 



Here we use standard multi- index notation. Thus, when a e N", we define x a to 
be xl 1 . . . x% n , and 

D ' /(I): =<-SF /(I) - 

Then <S(R n ) is the intersection of the completions S^^iW 1 ) of the spaces of 
compactly supported smooth functions in the norm 1 1 - 1 1 ( jvr> - It is a Frcchct space: 
it has a metric, for instance, 



but in general d(Xf, Xg) ^ |A d(f, g). 

2.2. Tempered distributions. The space of tempered distributions is the 

dual of <S(R"). It is the union of the spaces S^ N - ) (W 1 )' . Thus, any tempered 
distribution <J> belongs to some space S( N }(R n )' , and then 

mf)\<c H/ll w V/6 5(R"). 

Every slowly growing locally integrablc function fc, that is, a locally integrable 
functions whose total mass over a ball grows polynomially with the radius of the 
ball, defines a distribution $^ by integration: 



*fc(/) == / k(x)f(x)dx. 

JR™ 

A statement of the form $ = /, where $ is a distribution and / is a function, 
is to be interpreted that $ is the distribution obtained by integrating against /. 
For many purposes, we may consider all distributions as being given by integration 
against a function, as they may be derived from such distributions by standard 
analytical processes, such as taking limits or differentiating. 

Sometimes we write a formula involving distributions using pointwise notation, 
as shorthand for the integrated version of the formula. Thus, in R 2 ™, we will write 

$(x, y) = M(x) g(x - y) Vx, y G M™, 

where g G 5(R n ) and M e S{R n )'. This "means" that 

$(/) "=" / / M(x) g(x - y) f(x, y) dy dx 

JR" JK" 

M(x) / g(x -y) f(x,y)dydx 



for all / in 5(M 2 "), and so really means that applying the distribution <!> to / is the 
same as applying the distribution M to the function x J r „ g(x — y) f(x, y) dy. 

Every smooth function h, all of whose partial derivatives are of polynomial 
growth, multiplies <S(R n ) pointwise, and hence multiplies <S(R n )' by duality: for 
$ in S(R n )', 

h*(f) := *(hf) V/ £ 5(R n ). 
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In particular, we write E- lt \ for the function (x,y) i— > exp(27ria; • y) on R 2 ". Then 
multiplies S(R 2n ) and hence S(R 2 ™)'. 
For /, g in 5(M"), we define / <£> g in S(R 2 ™) by 

f®g{x,y):=f(x)g(y) V^yeR" 

The set of finite linear combinations of such "outer tensor product functions" is 
dense in «S(R 2n ). 

The Schwartz kernel theorem states that there is a one-to-one correspondence 
between continuous linear operators T from <S(R") to £>(R n )' and tempered distri- 
butions $ in S(R 2n )' , described by the formula 

r(/)(ff) = *Cs®/) v/ )9 e<S(R"). 

At least formally, we may write T(f)(x) — L„ &(x, y) f(y) dy. 

2.3. The Fourier transformation. The Fourier transformation T : / i— > / (where 
/ is defined by fl}) is a bijection of S(R n ). Generalising the formula 

/ f(x)g(x)dx= [ f(0 9(0 ^ V/.jel 1 ^), 

we define the Fourier transform $ of a tempered distribution 

$(/) := $(/) V/ e S(R n ). 

This extended Fourier transformation is also bijective on <S(R n )'. 

Suppose that / is a function on R 2 ™. The partial Fourier transform T%f of / is 
defined by 

F2f(x, ri) := / f(x, y) exp(-27ri?7 ■ y) dy Vx, ?? £ R" 

provided that this makes sense. It is easy to show that the partial Fourier trans- 
formation Ti is a bijection of 5(R 2 ™), and hence extends to iS(R 2 ™)'. 

2.4. Hardy's Uncertainty Principle for Distributions. As already mentioned, 
Demange (see [T]) proved the best version of Hardy's result for the usual quadratic 
exponentials: if $ £ S(R n )' and 

exp(^|.| 2 )a>e5 (Ar) (M")' 
exp(^|.| 2 )$e5 w (M")', 

then 

$(s)= c Q a; Q exp(-7r|a;| 2 ) Vx£l B , 

M<7V' 

We claim that we may assume that N' < N. To see this, we may suppose that 
c a ^ for some a such that \a\ = N' , in which case, for some yo in R™, there exists 
a positive constant c such that 

| c a (ty ) a \ =ct N ' V(£l+. 

\a\=N' 
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Now if <E> satisfies the hypotheses and hence also the conclusion of the theorem, 
then 



E 

|a|<JV' 



x a f(x) dx 



|$(exp(^|.| 2 )/)| = |(exp(^|-| 2 )$)(/)| 



<c H/ll {N) VfeS(R n ). 

Take a smooth function / with small compact support such that L„ f{x) dx = 1. 
Then, on the one hand, 



\\Tyf\\ (N) <c(l + |y|) 



N 



where r y f denotes the translate /(• — y) of /, while on the other, 



E c « 

a\<N' 



x a T y f(x) dx 



E • 

a\<N' 



\a\=N' 



(x + y) a f(x)dx 
+ 0(\y\ N '- 1 ), 



and the claim follows by taking y to be tyo and letting t grow. 

2.5. Vector-valued distributions. Take a Banach space X. 

For nice enough -valued functions /, we may still compute partial derivatives 
and multiply by scalar- valued functions. Hence we may form the vector- valued 
Schwartz space <S(R n ; X) of X-valued functions / such that ||/ ||n\n < °°i f° r au N 



in N, where the norm 



I (TV) 



is still defined by 



It is then possible to define the Fourier transform / of such a function, by 
formula |T]), where the integrand is X- valued, and to prove an X- valued version of 
Hardy's Uncertainty Principle. 



Theorem 3. Suppose that f: 
that 



X is a smooth vector-valued function such 



f(x)\\ x < exp(-?r \x\ ) 



/(0|L<exp(-^r 



Then f(x) — /(0) exp(— 7r \x\ ). 



Proof. Take an element V of the dual space X', and denote by V(f) the scalar- 
valued function x t— ► V(f(x)). Clearly this satisfies the inequalities 

\V(f)(x)\ < cexp(-7r \x\ 2 ) Vx S K" 

\(V(f))'(0\ <cexp(-7r|£| 2 ) V^eM", 

so V(f)(x) = c'exp(— 7r |a;| 2 ) for all x in W l . 

Evidently c' depends linearly on V, and so there is an element C of X" such 
that c' = C{V). Now, by taking x to be 0, we see that C(V) = 7(/(0)). □ 

Take a Frechet space X, which is the intersection of a decreasing family of 
Banach spaces Xi m) with increasing norms 
is defined by 

h-v\\ {M) 



(M) 



d(u,v) := 



2™(l + \\u-v\\ (M) ) 



. The standard Frechet metric d 



6 



M. G. COWLING, B. DEMANGE, AND M. SUNDARI 



Suppose that / is X-valued and continuous, and that for some M in N, 

I (AO 



\f(y)\\ (m <exp(-7r\y\ 2 ) Vy G E" 



\\f(ri)\\ {M) <eM'^n V^eM" 

Then f(y) = /(O)exp(-Tr |y| 2 ), by TheoremEl 

If d is the Frechet metric just dehned, and / : R™ — > X is continuous and 

d(/(y),0) <cexp(- 7 r| 2/ | 2 ) VyGR" 
d(/fa),0) <cexp(- 7 r|7 7 | 2 ) V77 G R" , 

then the same conclusion holds. Indeed, assuming (as we may) that c > 1, then 
these estimates imply that 

l|/(^)|lm 9 

(0) < ccxp(-7r|y| 2 ) Vj/eR" 



l + ll/(2/)ll ( o) 
11/ fo)" 



1 +11/(^)11(0) 

bat, when y and 
cexp(-7r|r?| 2 ) < 1/2 



(0) < cexp(-7r|ri| 2 ) V77 G 



and hence that, when ?/ and 77 are big enough so that cexp(— ir \y\ 2 ) < 1/2 and 



\\f(y)\\ {0) <2 C exp(- 7 r| 2/ | 2 ) 
1 1 /(»7) I |(o) ^ 2cexp(- 7 r|7 ? | 2 ). 

It follows that y 1— > V'(f(y)) is a continuous function on M" for any V in -X"(o)'> 
and that 

\V'(f(y))\ <2c||^'|| (0) cxp(-^|y| 2 ) V»6R\ [-«,«] 
V"(/(»7))| <2 c ||V'|| (0) exp(-^M 2 ) V*7GR\ [-«,«], 

where k = (log(2c)/7r) 1/2 . Thus, exp(7r | • \ 2 )V'{f{-)) and exp(7r | ■ \ 2 )V (/(•)) are 
distributions, and by Theorem [TJ 

V'(f(y)) = c(V) exp(-7r \y\ 2 ) Vy G R", 

where c(V') is a constant depending on V. But c(V) = V'(/(0)) for all V and 
hence 

/(y) = /(0)exp(-7r|y| 2 ) \/y G R™. 

Note that there are metrics (e.g., d 1 / 2 ) so that Hardy's Uncertainty Principle can- 
not be formulated as we have formulated the result for d. Thus this result is more 
a curiosity than "a theorem from the book" . 



We may generalise further, and consider distributions with values in X' , the dual 
of the Frechet space X. At least formally, every element $ of 5(R n ; X)' is such an 
object: for any such $ there exist M and N in N such that 

|*(/)|< c\\f\\ {M ,N) V/G<S(R";X), 
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where 

/ I, 2\ 

Wf\\(M,N) := ( X! jll ' aDl3 f\\(M) 
y \a\,\(3\<N 

Given an X'-valued distribution $ and an element V of X, we may define a 
scalar- valued distribution &(V) by the formula 

$(V)(f) :=$(fV) V/6 5(n 

where /V is the function whose value at x is f(x)V. Thus defined, $(V) is a 
distribution, because for 1/ in X and a scalar- valued function / in <S(R"), the 
vector- valued function fV is in S(M. n ;X). 

2.6. A vector- valued version of Theorem [TJ 

Theorem 4. Suppose that t is in K + , f/iat X is a Frechet space, that $ is an 
X' -valued distribution, and that 

exp(tTr\-\ 2 )$ £S(M. n ;Xy 
exp(^|.| 2 )$e5(IR";X)'. 

TTien 

$(i/)= ^ c a y a eM-tK\y\ 2 ) Vj,er, 

|a|<JV 

where C a € X' . 

In particular, this applies when X = S(M. n ). 

Proof. By rescaling if necessary, we may suppose without loss of generality that 
t = 1. 

As already remarked, there exist M and N in N such that 
exp(7r|.| 2 )$eS ( M,Ao(M n ;X)' 
exp(7r|.| 2 )de l S (M , J v)(K n ;X)'. 
For V in X, it follows that 

exp(7r|.| 2 )*(y) eS (A0 (R n )' 
exp(n\-\ 2 )<S>(V)eS (N) (W n y, 

and by Theorem [TJ 

|a|<Ar 

We now determine how the numbers c a (V) depend on V. 

The functions y i— > y Q exp(— 7r |y| 2 ) are linearly independent for different a, and 
we may find Schwartz functions fp such that 

y a exp(-7r |y| 2 ) /g(y) dy = 5 a ,p 

(the Kronecker delta) for all a and /? such that \a\ , < N. Now c^(V) is equal 
to $(V)(fp), and hence is linear in V, and 

| C/J (V)| < c(*) ||F|| (M) 

Thus there exists (Tg in Xf M > such that c^(F) = Cg(V). □ 
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3. Operators 

We consider operators K on L 2 (R n ) denned by locally integrable kernels k, that 

is, 

Kf(x) = J k(x, y) f(y) dy Vx £ E" 

for all / in L 2 (R n ). Such operators may be given different orderings. We write 
\K X \ < \K 2 \ if 

(3) ||#i/|| 2 < ||A' 2 /|| 2 V/ e L 2 (M n ) 

(this means that K*Ki < K 2 K 2 in the usual ordering of self-adjoint operators) 
and |jfei| < \k 2 \ if 

\h(x,y)\<\k 2 {x,y)\ Vx,yeW\ 

We may omit some of the absolute value signs if one of the operators is positive, 
or if one of the kernels is positive. Note that if K 2 has an (unbounded) inverse 
K2 with a dense domain, then the inequality ([3]) amounts to saying that K\K^ 
extends to a bounded operator of norm at most one. 

3.1. The heat operator. The heat semigroup is an important family of operators 
on L 2 (M n ). For positive t, we define the heat kernel p t (often described as "a 
Gaussian") by 

p t (x,y) ^-Lexp ^ 1 *"^ 2 ) Vz,yeR". 

We then define the heat operator Pt to be the operator corresponding to this kernel. 
Then TP t f = exp(-ivr | • \ 2 )Tf. 

We define the Laplacian A as a positive operator: 

4vr ^ Ox 2 ' 

so JF(A/) = 7T I • | 2 /, and Pt may be written as exp(— tA). The operator Pt has an 
unbounded inverse exp(tA) with dense domain. 

3.2. The main results. Implicitly, we use three quadratic forms Bi, B 2 and B 3 
on R n x MP: 

B 1 (x,y) = 2x-y B 2 (x,y) = \y\ 2 B 3 (x, y) = \x - y\ 2 

for all x and y in MP. Recall that we define E- h \(x,y) = exp(27ria; • y). We also 
write E r ^ 2 and E\ 3 for the exponential functions (x,y) 1— * exp(7r |y| 2 ) and (x,y) 1— > 
exp(t7r|a: — y\ 2 ) on R 2n . The subscripts r and i stand for real and imaginary, and 
the subscripts 1, 2 and 3 describe the quadratic form involved. The superscript t 
is used to indicate the real number t in the exponent. 

We are moving towards proving Theorem[21 According to our discussions above, 
this is a theorem about operators K and their locally integrable kernels k, such 
that ifexp(iA) is bounded on L 2 (R n ) and S* 3 fc G L°°(R 2n ). 

The following version of Hardy's theorem for operators boils down to Theorem 
[T| if the operator T from <S(R n ) to <S(R")' is given by a convolution, but otherwise 
it is more general. 
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Theorem 5. Suppose that t is in M + , and that T is an operator from iS(R n ) to 
l n Y such that Texp(tA) also maps S(R n ) into <S(R™)' and the kernel $ in 
g>2ny j rp f-gi ven fry fa e Schwartz kernel theorem) satisfies E\ 3 $ G <S(1L 2 ™)'. 
Then there exist a positive integer N and distributions M Q in iS(R™)' when \a\ < N 
such that 



$(x,y)= J2 M a (x)(x-y) a p t (x,y) Va;,yGR". 

\a\<N 

Proof. By rescaling, we may suppose that t = 1. By our hypotheses, 

£^3$ G <S(K 2 ")' 
^,2^2$ £ 5(M 2 ")'. 

Define the distribution W by 

= $(x,x- y) Vx,yeM". 
Then £ r>2 *(x,y) = E^fx, X - y) G S(R 2 ™)', while 

E T ^J r 2^{x,rf) = exp(7r|77| 2 ) / exp(~2iriri ■ y) *(x, y) dy 

JR™ 

= exp(7r|?7| 2 ) / exp(—2irirj-y)&(x,x — y)dy 



= exp(7r|?7| 2 ) ^i^^Cx, -77) dy. 

By the earlier result about multiplication by the function £^1 and the hypothesis, 
E Tt 2^i^ G S(R 2n )'. The conclusion now follows from Theorem |H with X taken to 
be's(ir). □ 

We are now able to prove Theorem[2l which we recall for the reader's convenience: 
suppose that t is in R + , that K is the operator on L 2 (R") defined by a locally 
integrable kernel k, and that 

1*1 <Pt 
\K\ < P t . 

Then there exists a bounded measurable function m on l n such that 

k(x,y) = m(x)pt(x,y) Vx,yGR n . 

If m is a bounded measurable function on R™ and k(x,y) — m(x) pt(x,y), then 
K is the heat operator P t followed by multiplication by m; the two inequalities of 
the theorem hold, but clearly k may be more general than a Gaussian. 

Proof of Theorem [H By rescaling if necessary, we may suppose that t = 1. 

For a nonnegative integer j, write j' for max{0, j — 1}. For / and g in <5(R n ), 

(f,KA*g)\ < ||/|| 2 ||A-A^|| 2 

<c||/|| 2 ||PiA^|| 2 

PiA^ g 

< c ||/|| 2 sup{exp(-7r|e| 2 ) (ttICI 2 / : £ G R™} ||A^" 9 | 



< c 



1(0) 11511(2) 



10 



M. G. COWLING, B. DEMANGE, AND M. SUNDARI 



(when j' = 0, the right hand side is interpreted as c||/||( ) ||<7||( 2 ))- 
Now 

|</,^exp(A) 5 )| <^2\\(f,K^g)\ 

<c ||/|| (0) ||<?|| (2) V/, 9 e5(R»). 

Thus the kernel of the operator K exp( A) is in S(R 2n )'. Application of T 2 shows 
that E, a T 2 k e S(R 2n )'. 

By Theorem[51 there exist M Q in <S(R n )' such that 

k(x,y)= M a (x)(x-y) a Pl (x,y) Vx lV eR n . 

\a\<N 

But |Ar| < pi, so N = and M = m for some m in L°°(IR n ). □ 
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